Overview
Brought to you by YData
Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 558837 |
| Missing cells | 123494 |
| Missing cells (%) | 1.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 371.7 MiB |
| Average record size in memory | 697.4 B |
Variable types
| Numeric | 5 |
|---|---|
| Text | 8 |
| Categorical | 3 |
color is highly overall correlated with transmission | High correlation |
mmr is highly overall correlated with odometer and 2 other fields | High correlation |
odometer is highly overall correlated with mmr and 2 other fields | High correlation |
sellingprice is highly overall correlated with mmr and 2 other fields | High correlation |
transmission is highly overall correlated with color | High correlation |
year is highly overall correlated with mmr and 2 other fields | High correlation |
transmission is highly imbalanced (90.1%) | Imbalance |
interior is highly imbalanced (50.4%) | Imbalance |
make has 10301 (1.8%) missing values | Missing |
model has 10399 (1.9%) missing values | Missing |
trim has 10715 (1.9%) missing values | Missing |
body has 13195 (2.4%) missing values | Missing |
transmission has 65321 (11.7%) missing values | Missing |
condition has 11820 (2.1%) missing values | Missing |
Reproduction
| Analysis started | 2025-09-29 15:59:38.469931 |
|---|---|
| Analysis finished | 2025-09-29 16:00:20.765257 |
| Duration | 42.3 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
year
Real number (ℝ)
High correlation
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.0389 |
| Minimum | 1982 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 1982 |
|---|---|
| 5-th percentile | 2002 |
| Q1 | 2007 |
| median | 2012 |
| Q3 | 2013 |
| 95-th percentile | 2014 |
| Maximum | 2015 |
| Range | 33 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.9668636 |
|---|---|
| Coefficient of variation (CV) | 0.0019735258 |
| Kurtosis | 1.0105063 |
| Mean | 2010.0389 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.1832259 |
| Sum | 1.1232841 × 109 |
| Variance | 15.736007 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2012 | 102315 | |
| 2013 | 98168 | |
| 2014 | 81070 | |
| 2011 | 48548 | |
| 2008 | 31502 | 5.6% |
| 2007 | 30845 | 5.5% |
| 2006 | 26913 | 4.8% |
| 2010 | 26485 | 4.7% |
| 2005 | 21394 | 3.8% |
| 2009 | 20594 | 3.7% |
| Other values (24) | 71003 |
| Value | Count | Frequency (%) |
| 1982 | 2 | < 0.1% |
| 1983 | 1 | < 0.1% |
| 1984 | 5 | < 0.1% |
| 1985 | 10 | < 0.1% |
| 1986 | 11 | < 0.1% |
| 1987 | 8 | < 0.1% |
| 1988 | 11 | < 0.1% |
| 1989 | 20 | < 0.1% |
| 1990 | 49 | |
| 1991 | 67 |
| Value | Count | Frequency (%) |
| 2015 | 9437 | 1.7% |
| 2014 | 81070 | |
| 2013 | 98168 | |
| 2012 | 102315 | |
| 2011 | 48548 | |
| 2010 | 26485 | 4.7% |
| 2009 | 20594 | 3.7% |
| 2008 | 31502 | 5.6% |
| 2007 | 30845 | 5.5% |
| 2006 | 26913 | 4.8% |
make
Text
Missing
| Distinct | 96 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10301 |
| Missing (%) | 1.8% |
| Memory size | 29.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 5.9952236 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Kia |
|---|---|
| 2nd row | Kia |
| 3rd row | BMW |
| 4th row | Volvo |
| 5th row | BMW |
| Value | Count | Frequency (%) |
| ford | 94001 | |
| chevrolet | 60587 | 11.0% |
| nissan | 54017 | 9.8% |
| toyota | 39966 | 7.3% |
| dodge | 30956 | 5.6% |
| honda | 27351 | 5.0% |
| hyundai | 21837 | 4.0% |
| bmw | 20793 | 3.8% |
| kia | 18084 | 3.3% |
| chrysler | 17485 | 3.2% |
| Other values (54) | 165367 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 328819 | 10.0% |
| e | 300106 | 9.1% |
| a | 235580 | 7.2% |
| r | 230200 | 7.0% |
| d | 215895 | 6.6% |
| n | 186317 | 5.7% |
| i | 184908 | 5.6% |
| s | 178494 | 5.4% |
| t | 128322 | 3.9% |
| l | 116781 | 3.6% |
| Other values (39) | 1183174 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3288596 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 328819 | 10.0% |
| e | 300106 | 9.1% |
| a | 235580 | 7.2% |
| r | 230200 | 7.0% |
| d | 215895 | 6.6% |
| n | 186317 | 5.7% |
| i | 184908 | 5.6% |
| s | 178494 | 5.4% |
| t | 128322 | 3.9% |
| l | 116781 | 3.6% |
| Other values (39) | 1183174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3288596 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 328819 | 10.0% |
| e | 300106 | 9.1% |
| a | 235580 | 7.2% |
| r | 230200 | 7.0% |
| d | 215895 | 6.6% |
| n | 186317 | 5.7% |
| i | 184908 | 5.6% |
| s | 178494 | 5.4% |
| t | 128322 | 3.9% |
| l | 116781 | 3.6% |
| Other values (39) | 1183174 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3288596 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 328819 | 10.0% |
| e | 300106 | 9.1% |
| a | 235580 | 7.2% |
| r | 230200 | 7.0% |
| d | 215895 | 6.6% |
| n | 186317 | 5.7% |
| i | 184908 | 5.6% |
| s | 178494 | 5.4% |
| t | 128322 | 3.9% |
| l | 116781 | 3.6% |
| Other values (39) | 1183174 |
model
Text
Missing
| Distinct | 973 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 10399 |
| Missing (%) | 1.9% |
| Memory size | 29.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 6.7691243 |
| Min length | 1 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sorento |
|---|---|
| 2nd row | Sorento |
| 3rd row | 3 Series |
| 4th row | S60 |
| 5th row | 6 Series Gran Coupe |
| Value | Count | Frequency (%) |
| altima | 19432 | 2.9% |
| series | 15429 | 2.3% |
| grand | 14928 | 2.2% |
| f-150 | 14527 | 2.2% |
| 1500 | 14476 | 2.2% |
| fusion | 13639 | 2.0% |
| camry | 13515 | 2.0% |
| escape | 12027 | 1.8% |
| focus | 10463 | 1.6% |
| g | 9333 | 1.4% |
| Other values (740) | 531345 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 376585 | 10.1% |
| r | 277169 | 7.5% |
| e | 269750 | 7.3% |
| o | 195221 | 5.3% |
| n | 184979 | 5.0% |
| i | 170339 | 4.6% |
| s | 149945 | 4.0% |
| t | 136207 | 3.7% |
| l | 132705 | 3.6% |
| C | 123260 | 3.3% |
| Other values (56) | 1696285 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3712445 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 376585 | 10.1% |
| r | 277169 | 7.5% |
| e | 269750 | 7.3% |
| o | 195221 | 5.3% |
| n | 184979 | 5.0% |
| i | 170339 | 4.6% |
| s | 149945 | 4.0% |
| t | 136207 | 3.7% |
| l | 132705 | 3.6% |
| C | 123260 | 3.3% |
| Other values (56) | 1696285 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3712445 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 376585 | 10.1% |
| r | 277169 | 7.5% |
| e | 269750 | 7.3% |
| o | 195221 | 5.3% |
| n | 184979 | 5.0% |
| i | 170339 | 4.6% |
| s | 149945 | 4.0% |
| t | 136207 | 3.7% |
| l | 132705 | 3.6% |
| C | 123260 | 3.3% |
| Other values (56) | 1696285 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3712445 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 376585 | 10.1% |
| r | 277169 | 7.5% |
| e | 269750 | 7.3% |
| o | 195221 | 5.3% |
| n | 184979 | 5.0% |
| i | 170339 | 4.6% |
| s | 149945 | 4.0% |
| t | 136207 | 3.7% |
| l | 132705 | 3.6% |
| C | 123260 | 3.3% |
| Other values (56) | 1696285 |
trim
Text
Missing
| Distinct | 1963 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 10715 |
| Missing (%) | 1.9% |
| Memory size | 28.4 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 37 |
| Mean length | 4.7365805 |
| Min length | 1 |
Unique
| Unique | 241 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LX |
|---|---|
| 2nd row | LX |
| 3rd row | 328i SULEV |
| 4th row | T5 |
| 5th row | 650i |
| Value | Count | Frequency (%) |
| base | 56122 | 8.3% |
| se | 48390 | 7.2% |
| s | 30312 | 4.5% |
| lx | 21376 | 3.2% |
| limited | 20582 | 3.1% |
| lt | 20224 | 3.0% |
| 2.5 | 18864 | 2.8% |
| xlt | 18796 | 2.8% |
| ls | 17932 | 2.7% |
| sport | 17602 | 2.6% |
| Other values (963) | 402154 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 214993 | 8.3% |
| S | 206476 | 8.0% |
| e | 155206 | 6.0% |
| i | 135234 | 5.2% |
| E | 127024 | 4.9% |
| 124233 | 4.8% | |
| T | 120883 | 4.7% |
| a | 108838 | 4.2% |
| r | 97787 | 3.8% |
| X | 91515 | 3.5% |
| Other values (62) | 1214035 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2596224 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| L | 214993 | 8.3% |
| S | 206476 | 8.0% |
| e | 155206 | 6.0% |
| i | 135234 | 5.2% |
| E | 127024 | 4.9% |
| 124233 | 4.8% | |
| T | 120883 | 4.7% |
| a | 108838 | 4.2% |
| r | 97787 | 3.8% |
| X | 91515 | 3.5% |
| Other values (62) | 1214035 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2596224 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| L | 214993 | 8.3% |
| S | 206476 | 8.0% |
| e | 155206 | 6.0% |
| i | 135234 | 5.2% |
| E | 127024 | 4.9% |
| 124233 | 4.8% | |
| T | 120883 | 4.7% |
| a | 108838 | 4.2% |
| r | 97787 | 3.8% |
| X | 91515 | 3.5% |
| Other values (62) | 1214035 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2596224 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| L | 214993 | 8.3% |
| S | 206476 | 8.0% |
| e | 155206 | 6.0% |
| i | 135234 | 5.2% |
| E | 127024 | 4.9% |
| 124233 | 4.8% | |
| T | 120883 | 4.7% |
| a | 108838 | 4.2% |
| r | 97787 | 3.8% |
| X | 91515 | 3.5% |
| Other values (62) | 1214035 |
body
Text
Missing
| Distinct | 87 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 13195 |
| Missing (%) | 2.4% |
| Memory size | 28.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 5 |
| Mean length | 5.2792729 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SUV |
|---|---|
| 2nd row | SUV |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| sedan | 248760 | |
| suv | 143844 | |
| cab | 33137 | 5.6% |
| hatchback | 26237 | 4.4% |
| minivan | 25529 | 4.3% |
| coupe | 19983 | 3.4% |
| crew | 16394 | 2.8% |
| wagon | 16180 | 2.7% |
| convertible | 10933 | 1.9% |
| g | 9333 | 1.6% |
| Other values (33) | 40608 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 398031 | |
| e | 352374 | |
| n | 338855 | |
| S | 338282 | |
| d | 262219 | 9.1% |
| V | 124796 | 4.3% |
| U | 119292 | 4.1% |
| C | 78559 | 2.7% |
| b | 77463 | 2.7% |
| s | 73869 | 2.6% |
| Other values (38) | 716853 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2880593 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 398031 | |
| e | 352374 | |
| n | 338855 | |
| S | 338282 | |
| d | 262219 | 9.1% |
| V | 124796 | 4.3% |
| U | 119292 | 4.1% |
| C | 78559 | 2.7% |
| b | 77463 | 2.7% |
| s | 73869 | 2.6% |
| Other values (38) | 716853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2880593 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 398031 | |
| e | 352374 | |
| n | 338855 | |
| S | 338282 | |
| d | 262219 | 9.1% |
| V | 124796 | 4.3% |
| U | 119292 | 4.1% |
| C | 78559 | 2.7% |
| b | 77463 | 2.7% |
| s | 73869 | 2.6% |
| Other values (38) | 716853 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2880593 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 398031 | |
| e | 352374 | |
| n | 338855 | |
| S | 338282 | |
| d | 262219 | 9.1% |
| V | 124796 | 4.3% |
| U | 119292 | 4.1% |
| C | 78559 | 2.7% |
| b | 77463 | 2.7% |
| s | 73869 | 2.6% |
| Other values (38) | 716853 |
transmission
Categorical
High correlation Imbalance Missing
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 65321 |
| Missing (%) | 11.7% |
| Memory size | 30.7 MiB |
| automatic | |
|---|---|
| manual | 17539 |
| horse-driven | 274 |
| sedan | 15 |
| Sedan | 11 |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 8.8948383 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | automatic |
|---|---|
| 2nd row | automatic |
| 3rd row | automatic |
| 4th row | automatic |
| 5th row | automatic |
Common Values
| Value | Count | Frequency (%) |
| automatic | 475677 | |
| manual | 17539 | 3.1% |
| horse-driven | 274 | < 0.1% |
| sedan | 15 | < 0.1% |
| Sedan | 11 | < 0.1% |
| (Missing) | 65321 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| automatic | 475677 | |
| manual | 17539 | 3.6% |
| horse-driven | 274 | 0.1% |
| sedan | 26 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 986458 | |
| t | 951354 | |
| u | 493216 | |
| m | 493216 | |
| o | 475951 | |
| i | 475951 | |
| c | 475677 | |
| n | 17839 | 0.4% |
| l | 17539 | 0.4% |
| e | 574 | < 0.1% |
| Other values (7) | 1970 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4389745 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 986458 | |
| t | 951354 | |
| u | 493216 | |
| m | 493216 | |
| o | 475951 | |
| i | 475951 | |
| c | 475677 | |
| n | 17839 | 0.4% |
| l | 17539 | 0.4% |
| e | 574 | < 0.1% |
| Other values (7) | 1970 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4389745 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 986458 | |
| t | 951354 | |
| u | 493216 | |
| m | 493216 | |
| o | 475951 | |
| i | 475951 | |
| c | 475677 | |
| n | 17839 | 0.4% |
| l | 17539 | 0.4% |
| e | 574 | < 0.1% |
| Other values (7) | 1970 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4389745 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 986458 | |
| t | 951354 | |
| u | 493216 | |
| m | 493216 | |
| o | 475951 | |
| i | 475951 | |
| c | 475677 | |
| n | 17839 | 0.4% |
| l | 17539 | 0.4% |
| e | 574 | < 0.1% |
| Other values (7) | 1970 | < 0.1% |
vin
Text
| Distinct | 550297 |
|---|---|
| Distinct (%) | 98.5% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 35.2 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.999585 |
| Min length | 3 |
Unique
| Unique | 541970 ? |
|---|---|
| Unique (%) | 97.0% |
Sample
| 1st row | 5xyktca69fg566472 |
|---|---|
| 2nd row | 5xyktca69fg561319 |
| 3rd row | wba3c1c51ek116351 |
| 4th row | yv1612tb4f1310987 |
| 5th row | wba6b2c57ed129731 |
| Value | Count | Frequency (%) |
| automatic | 22 | < 0.1% |
| wbanv13588cz57827 | 5 | < 0.1% |
| 1ftfw1cv5afb30053 | 4 | < 0.1% |
| 5uxfe43579l274932 | 4 | < 0.1% |
| 5n1ar1nn2bc632869 | 4 | < 0.1% |
| wddgf56x78f009940 | 4 | < 0.1% |
| trusc28n241022003 | 4 | < 0.1% |
| wp0ca2988xu629622 | 4 | < 0.1% |
| wbxpa93416wd25282 | 3 | < 0.1% |
| yv1mc67288j052897 | 3 | < 0.1% |
| Other values (550287) | 558776 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 919854 | 9.7% |
| 2 | 636863 | 6.7% |
| 3 | 612467 | 6.4% |
| 5 | 595556 | 6.3% |
| 4 | 574940 | 6.1% |
| 0 | 498895 | 5.3% |
| 6 | 487519 | 5.1% |
| 7 | 458863 | 4.8% |
| 8 | 455044 | 4.8% |
| c | 381530 | 4.0% |
| Other values (26) | 3878398 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9499929 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 919854 | 9.7% |
| 2 | 636863 | 6.7% |
| 3 | 612467 | 6.4% |
| 5 | 595556 | 6.3% |
| 4 | 574940 | 6.1% |
| 0 | 498895 | 5.3% |
| 6 | 487519 | 5.1% |
| 7 | 458863 | 4.8% |
| 8 | 455044 | 4.8% |
| c | 381530 | 4.0% |
| Other values (26) | 3878398 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9499929 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 919854 | 9.7% |
| 2 | 636863 | 6.7% |
| 3 | 612467 | 6.4% |
| 5 | 595556 | 6.3% |
| 4 | 574940 | 6.1% |
| 0 | 498895 | 5.3% |
| 6 | 487519 | 5.1% |
| 7 | 458863 | 4.8% |
| 8 | 455044 | 4.8% |
| c | 381530 | 4.0% |
| Other values (26) | 3878398 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9499929 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 919854 | 9.7% |
| 2 | 636863 | 6.7% |
| 3 | 612467 | 6.4% |
| 5 | 595556 | 6.3% |
| 4 | 574940 | 6.1% |
| 0 | 498895 | 5.3% |
| 6 | 487519 | 5.1% |
| 7 | 458863 | 4.8% |
| 8 | 455044 | 4.8% |
| c | 381530 | 4.0% |
| Other values (26) | 3878398 |
state
Text
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 27.2 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 2 |
| Mean length | 2.0006979 |
| Min length | 2 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ca |
|---|---|
| 2nd row | ca |
| 3rd row | ca |
| 4th row | ca |
| 5th row | ca |
| Value | Count | Frequency (%) |
| fl | 82945 | |
| ca | 73148 | |
| pa | 53907 | 9.6% |
| tx | 45913 | 8.2% |
| ga | 34750 | 6.2% |
| nj | 27784 | 5.0% |
| il | 23486 | 4.2% |
| nc | 21845 | 3.9% |
| oh | 21575 | 3.9% |
| tn | 20895 | 3.7% |
| Other values (54) | 152589 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 199889 | |
| n | 110349 | |
| l | 108648 | |
| c | 108264 | |
| f | 82971 | 7.4% |
| t | 68644 | 6.1% |
| m | 60888 | 5.4% |
| p | 56632 | 5.1% |
| i | 54410 | 4.9% |
| o | 50032 | 4.5% |
| Other values (26) | 217337 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1118064 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 199889 | |
| n | 110349 | |
| l | 108648 | |
| c | 108264 | |
| f | 82971 | 7.4% |
| t | 68644 | 6.1% |
| m | 60888 | 5.4% |
| p | 56632 | 5.1% |
| i | 54410 | 4.9% |
| o | 50032 | 4.5% |
| Other values (26) | 217337 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1118064 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 199889 | |
| n | 110349 | |
| l | 108648 | |
| c | 108264 | |
| f | 82971 | 7.4% |
| t | 68644 | 6.1% |
| m | 60888 | 5.4% |
| p | 56632 | 5.1% |
| i | 54410 | 4.9% |
| o | 50032 | 4.5% |
| Other values (26) | 217337 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1118064 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 199889 | |
| n | 110349 | |
| l | 108648 | |
| c | 108264 | |
| f | 82971 | 7.4% |
| t | 68644 | 6.1% |
| m | 60888 | 5.4% |
| p | 56632 | 5.1% |
| i | 54410 | 4.9% |
| o | 50032 | 4.5% |
| Other values (26) | 217337 |
condition
Real number (ℝ)
Missing
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11820 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.680052 |
| Minimum | 0 |
|---|---|
| Maximum | 982 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 23 |
| median | 35 |
| Q3 | 42 |
| 95-th percentile | 47 |
| Maximum | 982 |
| Range | 982 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 13.585974 |
|---|---|
| Coefficient of variation (CV) | 0.4428276 |
| Kurtosis | 99.271038 |
| Mean | 30.680052 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.77213291 |
| Sum | 16782510 |
| Variance | 184.57868 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 42280 | 7.6% |
| 35 | 26749 | 4.8% |
| 37 | 25938 | 4.6% |
| 44 | 25514 | 4.6% |
| 43 | 24937 | 4.5% |
| 42 | 24328 | 4.4% |
| 36 | 23144 | 4.1% |
| 41 | 23073 | 4.1% |
| 2 | 20788 | 3.7% |
| 4 | 19922 | 3.6% |
| Other values (39) | 290344 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 7363 | 1.3% |
| 2 | 20788 | |
| 3 | 10802 | |
| 4 | 19922 | |
| 5 | 11222 | |
| 11 | 87 | < 0.1% |
| 12 | 95 | < 0.1% |
| 13 | 82 | < 0.1% |
| 14 | 134 | < 0.1% |
| Value | Count | Frequency (%) |
| 982 | 1 | < 0.1% |
| 897 | 1 | < 0.1% |
| 849 | 1 | < 0.1% |
| 313 | 2 | < 0.1% |
| 289 | 2 | < 0.1% |
| 280 | 1 | < 0.1% |
| 193 | 1 | < 0.1% |
| 49 | 13099 | |
| 48 | 12712 | |
| 47 | 11362 |
odometer
Real number (ℝ)
High correlation
| Distinct | 172276 |
|---|---|
| Distinct (%) | 30.8% |
| Missing | 94 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68311.538 |
| Minimum | 0 |
|---|---|
| Maximum | 999999 |
| Zeros | 152 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10491.1 |
| Q1 | 28359 |
| median | 52247 |
| Q3 | 99108.5 |
| 95-th percentile | 170056.9 |
| Maximum | 999999 |
| Range | 999999 |
| Interquartile range (IQR) | 70749.5 |
Descriptive statistics
| Standard deviation | 53405.247 |
|---|---|
| Coefficient of variation (CV) | 0.78178956 |
| Kurtosis | 13.542316 |
| Mean | 68311.538 |
| Median Absolute Deviation (MAD) | 30480 |
| Skewness | 1.8426065 |
| Sum | 3.8168594 × 1010 |
| Variance | 2.8521204 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1318 | 0.2% |
| 0 | 152 | < 0.1% |
| 999999 | 72 | < 0.1% |
| 10 | 29 | < 0.1% |
| 21587 | 21 | < 0.1% |
| 2 | 18 | < 0.1% |
| 29137 | 18 | < 0.1% |
| 8 | 18 | < 0.1% |
| 21310 | 18 | < 0.1% |
| 36007 | 17 | < 0.1% |
| Other values (172266) | 557062 | |
| (Missing) | 94 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 152 | < 0.1% |
| 1 | 1318 | |
| 2 | 18 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 5 | 17 | < 0.1% |
| 6 | 13 | < 0.1% |
| 7 | 13 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 999999 | 72 | |
| 980113 | 1 | < 0.1% |
| 959276 | 1 | < 0.1% |
| 694978 | 2 | < 0.1% |
| 621388 | 1 | < 0.1% |
| 580956 | 1 | < 0.1% |
| 537334 | 1 | < 0.1% |
| 522212 | 1 | < 0.1% |
| 500227 | 1 | < 0.1% |
| 495757 | 1 | < 0.1% |
color
Categorical
High correlation
| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 749 |
| Missing (%) | 0.1% |
| Memory size | 29.1 MiB |
| black | |
|---|---|
| white | |
| silver | |
| gray | |
| blue | |
| Other values (41) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 4.6275032 |
| Min length | 1 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | white |
|---|---|
| 2nd row | white |
| 3rd row | gray |
| 4th row | white |
| 5th row | gray |
Common Values
| Value | Count | Frequency (%) |
| black | 110970 | |
| white | 106673 | |
| silver | 83389 | |
| gray | 82857 | |
| blue | 51139 | |
| red | 43569 | 7.8% |
| — | 24685 | 4.4% |
| green | 11382 | 2.0% |
| gold | 11342 | 2.0% |
| beige | 9222 | 1.7% |
| Other values (36) | 22860 | 4.1% |
Length
| Value | Count | Frequency (%) |
| black | 110970 | |
| white | 106673 | |
| silver | 83389 | |
| gray | 82857 | |
| blue | 51139 | |
| red | 43569 | 7.8% |
| — | 24685 | 4.4% |
| green | 11382 | 2.0% |
| gold | 11342 | 2.0% |
| beige | 9222 | 1.7% |
| Other values (36) | 22860 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 332602 | |
| l | 261465 | 10.1% |
| r | 241240 | 9.3% |
| i | 201026 | 7.8% |
| a | 196863 | 7.6% |
| b | 187020 | 7.2% |
| g | 125853 | 4.9% |
| w | 116124 | 4.5% |
| c | 111928 | 4.3% |
| k | 111012 | 4.3% |
| Other values (25) | 697421 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2582554 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 332602 | |
| l | 261465 | 10.1% |
| r | 241240 | 9.3% |
| i | 201026 | 7.8% |
| a | 196863 | 7.6% |
| b | 187020 | 7.2% |
| g | 125853 | 4.9% |
| w | 116124 | 4.5% |
| c | 111928 | 4.3% |
| k | 111012 | 4.3% |
| Other values (25) | 697421 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2582554 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 332602 | |
| l | 261465 | 10.1% |
| r | 241240 | 9.3% |
| i | 201026 | 7.8% |
| a | 196863 | 7.6% |
| b | 187020 | 7.2% |
| g | 125853 | 4.9% |
| w | 116124 | 4.5% |
| c | 111928 | 4.3% |
| k | 111012 | 4.3% |
| Other values (25) | 697421 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2582554 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 332602 | |
| l | 261465 | 10.1% |
| r | 241240 | 9.3% |
| i | 201026 | 7.8% |
| a | 196863 | 7.6% |
| b | 187020 | 7.2% |
| g | 125853 | 4.9% |
| w | 116124 | 4.5% |
| c | 111928 | 4.3% |
| k | 111012 | 4.3% |
| Other values (25) | 697421 |
interior
Categorical
Imbalance
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 749 |
| Missing (%) | 0.1% |
| Memory size | 28.8 MiB |
| black | |
|---|---|
| gray | |
| beige | |
| tan | |
| — | 17077 |
| Other values (12) | 14250 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 4.399437 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | black |
|---|---|
| 2nd row | beige |
| 3rd row | black |
| 4th row | black |
| 5th row | black |
Common Values
| Value | Count | Frequency (%) |
| black | 244329 | |
| gray | 178581 | |
| beige | 59758 | 10.7% |
| tan | 44093 | 7.9% |
| — | 17077 | 3.1% |
| brown | 8640 | 1.5% |
| red | 1363 | 0.2% |
| blue | 1143 | 0.2% |
| silver | 1104 | 0.2% |
| off-white | 480 | 0.1% |
| Other values (7) | 1520 | 0.3% |
| (Missing) | 749 | 0.1% |
Length
| Value | Count | Frequency (%) |
| black | 244329 | |
| gray | 178581 | |
| beige | 59758 | 10.7% |
| tan | 44093 | 7.9% |
| — | 17077 | 3.1% |
| brown | 8640 | 1.5% |
| red | 1363 | 0.2% |
| blue | 1143 | 0.2% |
| silver | 1104 | 0.2% |
| off-white | 480 | 0.1% |
| Other values (7) | 1520 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 467148 | |
| b | 314061 | |
| l | 247279 | |
| c | 244329 | |
| k | 244329 | |
| g | 239244 | |
| r | 190608 | |
| y | 178792 | 7.3% |
| e | 124856 | 5.1% |
| i | 61598 | 2.5% |
| Other values (13) | 143029 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2455273 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 467148 | |
| b | 314061 | |
| l | 247279 | |
| c | 244329 | |
| k | 244329 | |
| g | 239244 | |
| r | 190608 | |
| y | 178792 | 7.3% |
| e | 124856 | 5.1% |
| i | 61598 | 2.5% |
| Other values (13) | 143029 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2455273 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 467148 | |
| b | 314061 | |
| l | 247279 | |
| c | 244329 | |
| k | 244329 | |
| g | 239244 | |
| r | 190608 | |
| y | 178792 | 7.3% |
| e | 124856 | 5.1% |
| i | 61598 | 2.5% |
| Other values (13) | 143029 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2455273 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 467148 | |
| b | 314061 | |
| l | 247279 | |
| c | 244329 | |
| k | 244329 | |
| g | 239244 | |
| r | 190608 | |
| y | 178792 | 7.3% |
| e | 124856 | 5.1% |
| i | 61598 | 2.5% |
| Other values (13) | 143029 | 5.8% |
seller
Text
| Distinct | 14264 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.4 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 42 |
| Mean length | 22.990081 |
| Min length | 3 |
Unique
| Unique | 4949 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | kia motors america inc |
|---|---|
| 2nd row | kia motors america inc |
| 3rd row | financial services remarketing (lease) |
| 4th row | volvo na rep/world omni |
| 5th row | financial services remarketing (lease) |
| Value | Count | Frequency (%) |
| inc | 86909 | 4.6% |
| services | 48240 | 2.6% |
| corporation | 47850 | 2.5% |
| auto | 47452 | 2.5% |
| credit | 46959 | 2.5% |
| motor | 45807 | 2.4% |
| llc | 45554 | 2.4% |
| financial | 44150 | 2.3% |
| ford | 36212 | 1.9% |
| remarketing | 35475 | 1.9% |
| Other values (8581) | 1395305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1339787 | 10.4% | |
| e | 1146331 | 8.9% |
| a | 1052355 | 8.2% |
| r | 962265 | 7.5% |
| n | 953446 | 7.4% |
| i | 917296 | 7.1% |
| o | 863200 | 6.7% |
| t | 796230 | 6.2% |
| c | 736349 | 5.7% |
| s | 671368 | 5.2% |
| Other values (37) | 3409081 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12847708 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1339787 | 10.4% | |
| e | 1146331 | 8.9% |
| a | 1052355 | 8.2% |
| r | 962265 | 7.5% |
| n | 953446 | 7.4% |
| i | 917296 | 7.1% |
| o | 863200 | 6.7% |
| t | 796230 | 6.2% |
| c | 736349 | 5.7% |
| s | 671368 | 5.2% |
| Other values (37) | 3409081 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12847708 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1339787 | 10.4% | |
| e | 1146331 | 8.9% |
| a | 1052355 | 8.2% |
| r | 962265 | 7.5% |
| n | 953446 | 7.4% |
| i | 917296 | 7.1% |
| o | 863200 | 6.7% |
| t | 796230 | 6.2% |
| c | 736349 | 5.7% |
| s | 671368 | 5.2% |
| Other values (37) | 3409081 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12847708 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1339787 | 10.4% | |
| e | 1146331 | 8.9% |
| a | 1052355 | 8.2% |
| r | 962265 | 7.5% |
| n | 953446 | 7.4% |
| i | 917296 | 7.1% |
| o | 863200 | 6.7% |
| t | 796230 | 6.2% |
| c | 736349 | 5.7% |
| s | 671368 | 5.2% |
| Other values (37) | 3409081 |
mmr
Real number (ℝ)
High correlation
| Distinct | 1101 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 123 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13770.378 |
| Minimum | 25 |
|---|---|
| Maximum | 182000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 1800 |
| Q1 | 7100 |
| median | 12250 |
| Q3 | 18300 |
| 95-th percentile | 30600 |
| Maximum | 182000 |
| Range | 181975 |
| Interquartile range (IQR) | 11200 |
Descriptive statistics
| Standard deviation | 9680.2193 |
|---|---|
| Coefficient of variation (CV) | 0.70297414 |
| Kurtosis | 11.443099 |
| Mean | 13770.378 |
| Median Absolute Deviation (MAD) | 5575 |
| Skewness | 1.997564 |
| Sum | 7.6937027 × 109 |
| Variance | 93706645 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12500 | 1760 | 0.3% |
| 11600 | 1751 | 0.3% |
| 11650 | 1746 | 0.3% |
| 12150 | 1722 | 0.3% |
| 11850 | 1716 | 0.3% |
| 11300 | 1716 | 0.3% |
| 11750 | 1709 | 0.3% |
| 12350 | 1702 | 0.3% |
| 12700 | 1701 | 0.3% |
| 11950 | 1694 | 0.3% |
| Other values (1091) | 541497 |
| Value | Count | Frequency (%) |
| 25 | 30 | < 0.1% |
| 50 | 44 | |
| 75 | 23 | < 0.1% |
| 100 | 33 | < 0.1% |
| 125 | 40 | |
| 150 | 45 | |
| 175 | 69 | |
| 200 | 54 | |
| 225 | 60 | |
| 250 | 83 |
| Value | Count | Frequency (%) |
| 182000 | 1 | < 0.1% |
| 178000 | 1 | < 0.1% |
| 176000 | 1 | < 0.1% |
| 172000 | 1 | < 0.1% |
| 170000 | 3 | |
| 166000 | 3 | |
| 164000 | 1 | < 0.1% |
| 163000 | 1 | < 0.1% |
| 162000 | 1 | < 0.1% |
| 161000 | 1 | < 0.1% |
sellingprice
Real number (ℝ)
High correlation
| Distinct | 1887 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13611.359 |
| Minimum | 1 |
|---|---|
| Maximum | 230000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1500 |
| Q1 | 6900 |
| median | 12100 |
| Q3 | 18200 |
| 95-th percentile | 30600 |
| Maximum | 230000 |
| Range | 229999 |
| Interquartile range (IQR) | 11300 |
Descriptive statistics
| Standard deviation | 9749.5016 |
|---|---|
| Coefficient of variation (CV) | 0.71627688 |
| Kurtosis | 11.114646 |
| Mean | 13611.359 |
| Median Absolute Deviation (MAD) | 5650 |
| Skewness | 1.9534444 |
| Sum | 7.6063676 × 109 |
| Variance | 95052782 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11000 | 4453 | 0.8% |
| 12000 | 4450 | 0.8% |
| 13000 | 4334 | 0.8% |
| 10000 | 4029 | 0.7% |
| 14000 | 3899 | 0.7% |
| 11500 | 3876 | 0.7% |
| 12500 | 3714 | 0.7% |
| 9000 | 3689 | 0.7% |
| 10500 | 3540 | 0.6% |
| 15000 | 3386 | 0.6% |
| Other values (1877) | 519455 |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 100 | 19 | < 0.1% |
| 125 | 1 | < 0.1% |
| 150 | 21 | < 0.1% |
| 175 | 10 | < 0.1% |
| 200 | 196 | < 0.1% |
| 225 | 105 | < 0.1% |
| 250 | 281 | 0.1% |
| 275 | 124 | < 0.1% |
| 300 | 1282 |
| Value | Count | Frequency (%) |
| 230000 | 1 | |
| 183000 | 1 | |
| 173000 | 1 | |
| 171500 | 1 | |
| 169500 | 1 | |
| 169000 | 1 | |
| 167000 | 1 | |
| 165000 | 2 | |
| 163000 | 2 | |
| 161000 | 1 |
saledate
Text
| Distinct | 3766 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 46.9 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 39 |
| Mean length | 38.998409 |
| Min length | 4 |
Unique
| Unique | 604 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Tue Dec 16 2014 12:30:00 GMT-0800 (PST) |
|---|---|
| 2nd row | Tue Dec 16 2014 12:30:00 GMT-0800 (PST) |
| 3rd row | Thu Jan 15 2015 04:30:00 GMT-0800 (PST) |
| 4th row | Thu Jan 29 2015 04:30:00 GMT-0800 (PST) |
| 5th row | Thu Dec 18 2014 12:30:00 GMT-0800 (PST) |
| Value | Count | Frequency (%) |
| 2015 | 505072 | 12.9% |
| pst | 395489 | 10.1% |
| gmt-0800 | 395489 | 10.1% |
| wed | 166069 | 4.2% |
| tue | 163950 | 4.2% |
| pdt | 163310 | 4.2% |
| gmt-0700 | 163310 | 4.2% |
| feb | 163053 | 4.2% |
| thu | 153750 | 3.9% |
| jan | 140815 | 3.6% |
| Other values (334) | 1501312 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4819691 | |
| 3352794 | ||
| T | 1435298 | 6.6% |
| : | 1117598 | 5.1% |
| 1 | 1061408 | 4.9% |
| 2 | 962306 | 4.4% |
| M | 673292 | 3.1% |
| 5 | 660463 | 3.0% |
| ) | 558799 | 2.6% |
| G | 558799 | 2.6% |
| Other values (30) | 6592838 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21793286 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4819691 | |
| 3352794 | ||
| T | 1435298 | 6.6% |
| : | 1117598 | 5.1% |
| 1 | 1061408 | 4.9% |
| 2 | 962306 | 4.4% |
| M | 673292 | 3.1% |
| 5 | 660463 | 3.0% |
| ) | 558799 | 2.6% |
| G | 558799 | 2.6% |
| Other values (30) | 6592838 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21793286 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4819691 | |
| 3352794 | ||
| T | 1435298 | 6.6% |
| : | 1117598 | 5.1% |
| 1 | 1061408 | 4.9% |
| 2 | 962306 | 4.4% |
| M | 673292 | 3.1% |
| 5 | 660463 | 3.0% |
| ) | 558799 | 2.6% |
| G | 558799 | 2.6% |
| Other values (30) | 6592838 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21793286 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4819691 | |
| 3352794 | ||
| T | 1435298 | 6.6% |
| : | 1117598 | 5.1% |
| 1 | 1061408 | 4.9% |
| 2 | 962306 | 4.4% |
| M | 673292 | 3.1% |
| 5 | 660463 | 3.0% |
| ) | 558799 | 2.6% |
| G | 558799 | 2.6% |
| Other values (30) | 6592838 |
Interactions
Correlations
| color | condition | interior | mmr | odometer | sellingprice | transmission | year | |
|---|---|---|---|---|---|---|---|---|
| color | 1.000 | 0.000 | 0.102 | 0.056 | 0.066 | 0.048 | 0.708 | 0.091 |
| condition | 0.000 | 1.000 | 0.008 | 0.427 | -0.404 | 0.480 | 0.000 | 0.387 |
| interior | 0.102 | 0.008 | 1.000 | 0.063 | 0.091 | 0.062 | 0.056 | 0.109 |
| mmr | 0.056 | 0.427 | 0.063 | 1.000 | -0.718 | 0.979 | 0.020 | 0.697 |
| odometer | 0.066 | -0.404 | 0.091 | -0.718 | 1.000 | -0.704 | 0.015 | -0.817 |
| sellingprice | 0.048 | 0.480 | 0.062 | 0.979 | -0.704 | 1.000 | 0.008 | 0.679 |
| transmission | 0.708 | 0.000 | 0.056 | 0.020 | 0.015 | 0.008 | 1.000 | 0.055 |
| year | 0.091 | 0.387 | 0.109 | 0.697 | -0.817 | 0.679 | 0.055 | 1.000 |
Missing values
Sample
| year | make | model | trim | body | transmission | vin | state | condition | odometer | color | interior | seller | mmr | sellingprice | saledate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015 | Kia | Sorento | LX | SUV | automatic | 5xyktca69fg566472 | ca | 5.0 | 16639.0 | white | black | kia motors america inc | 20500.0 | 21500.0 | Tue Dec 16 2014 12:30:00 GMT-0800 (PST) |
| 1 | 2015 | Kia | Sorento | LX | SUV | automatic | 5xyktca69fg561319 | ca | 5.0 | 9393.0 | white | beige | kia motors america inc | 20800.0 | 21500.0 | Tue Dec 16 2014 12:30:00 GMT-0800 (PST) |
| 2 | 2014 | BMW | 3 Series | 328i SULEV | Sedan | automatic | wba3c1c51ek116351 | ca | 45.0 | 1331.0 | gray | black | financial services remarketing (lease) | 31900.0 | 30000.0 | Thu Jan 15 2015 04:30:00 GMT-0800 (PST) |
| 3 | 2015 | Volvo | S60 | T5 | Sedan | automatic | yv1612tb4f1310987 | ca | 41.0 | 14282.0 | white | black | volvo na rep/world omni | 27500.0 | 27750.0 | Thu Jan 29 2015 04:30:00 GMT-0800 (PST) |
| 4 | 2014 | BMW | 6 Series Gran Coupe | 650i | Sedan | automatic | wba6b2c57ed129731 | ca | 43.0 | 2641.0 | gray | black | financial services remarketing (lease) | 66000.0 | 67000.0 | Thu Dec 18 2014 12:30:00 GMT-0800 (PST) |
| 5 | 2015 | Nissan | Altima | 2.5 S | Sedan | automatic | 1n4al3ap1fn326013 | ca | 1.0 | 5554.0 | gray | black | enterprise vehicle exchange / tra / rental / tulsa | 15350.0 | 10900.0 | Tue Dec 30 2014 12:00:00 GMT-0800 (PST) |
| 6 | 2014 | BMW | M5 | Base | Sedan | automatic | wbsfv9c51ed593089 | ca | 34.0 | 14943.0 | black | black | the hertz corporation | 69000.0 | 65000.0 | Wed Dec 17 2014 12:30:00 GMT-0800 (PST) |
| 7 | 2014 | Chevrolet | Cruze | 1LT | Sedan | automatic | 1g1pc5sb2e7128460 | ca | 2.0 | 28617.0 | black | black | enterprise vehicle exchange / tra / rental / tulsa | 11900.0 | 9800.0 | Tue Dec 16 2014 13:00:00 GMT-0800 (PST) |
| 8 | 2014 | Audi | A4 | 2.0T Premium Plus quattro | Sedan | automatic | wauffafl3en030343 | ca | 42.0 | 9557.0 | white | black | audi mission viejo | 32100.0 | 32250.0 | Thu Dec 18 2014 12:00:00 GMT-0800 (PST) |
| 9 | 2014 | Chevrolet | Camaro | LT | Convertible | automatic | 2g1fb3d37e9218789 | ca | 3.0 | 4809.0 | red | black | d/m auto sales inc | 26300.0 | 17500.0 | Tue Jan 20 2015 04:00:00 GMT-0800 (PST) |
| year | make | model | trim | body | transmission | vin | state | condition | odometer | color | interior | seller | mmr | sellingprice | saledate | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 558827 | 2014 | Jeep | Grand Cherokee | Laredo | SUV | automatic | 1c4rjfag0ec466276 | pa | 42.0 | 25180.0 | gray | black | hertz corporation/gdp | 26000.0 | 24500.0 | Tue Jul 07 2015 06:30:00 GMT-0700 (PDT) |
| 558828 | 2012 | Dodge | Grand Caravan | American Value Package | Minivan | automatic | 2c4rdgbg1cr349287 | ma | 37.0 | 97036.0 | silver | gray | ge fleet services for itself/servicer | 8300.0 | 7800.0 | Tue Jul 07 2015 06:30:00 GMT-0700 (PDT) |
| 558829 | 2012 | Hyundai | Elantra | Limited | Sedan | NaN | 5npdh4ae7ch106397 | pa | 4.0 | 66720.0 | gray | gray | champion mazda | 10250.0 | 10400.0 | Wed Jul 08 2015 07:30:00 GMT-0700 (PDT) |
| 558830 | 2012 | Nissan | Sentra | 2.0 SR | Sedan | NaN | 3n1ab6ap3cl622485 | tn | 26.0 | 35858.0 | white | gray | nissan-infiniti lt | 9950.0 | 10400.0 | Wed Jul 08 2015 17:15:00 GMT-0700 (PDT) |
| 558831 | 2011 | BMW | 5 Series | 528i | Sedan | automatic | wbafr1c53bc744672 | fl | 39.0 | 66403.0 | white | brown | lauderdale imports ltd bmw pembrok pines | 20300.0 | 22800.0 | Tue Jul 07 2015 06:15:00 GMT-0700 (PDT) |
| 558832 | 2015 | Kia | K900 | Luxury | Sedan | NaN | knalw4d4xf6019304 | in | 45.0 | 18255.0 | silver | black | avis corporation | 35300.0 | 33000.0 | Thu Jul 09 2015 07:00:00 GMT-0700 (PDT) |
| 558833 | 2012 | Ram | 2500 | Power Wagon | Crew Cab | automatic | 3c6td5et6cg112407 | wa | 5.0 | 54393.0 | white | black | i -5 uhlmann rv | 30200.0 | 30800.0 | Wed Jul 08 2015 09:30:00 GMT-0700 (PDT) |
| 558834 | 2012 | BMW | X5 | xDrive35d | SUV | automatic | 5uxzw0c58cl668465 | ca | 48.0 | 50561.0 | black | black | financial services remarketing (lease) | 29800.0 | 34000.0 | Wed Jul 08 2015 09:30:00 GMT-0700 (PDT) |
| 558835 | 2015 | Nissan | Altima | 2.5 S | sedan | automatic | 1n4al3ap0fc216050 | ga | 38.0 | 16658.0 | white | black | enterprise vehicle exchange / tra / rental / tulsa | 15100.0 | 11100.0 | Thu Jul 09 2015 06:45:00 GMT-0700 (PDT) |
| 558836 | 2014 | Ford | F-150 | XLT | SuperCrew | automatic | 1ftfw1et2eke87277 | ca | 34.0 | 15008.0 | gray | gray | ford motor credit company llc pd | 29600.0 | 26700.0 | Thu May 28 2015 05:30:00 GMT-0700 (PDT) |